Model Selection

Transformer encoder

# Transformer encoder

A vision Transformer model trained using the DINOv2 method for self-supervised image feature extraction

Image Classification

Vision Transformer model trained using the DINOv2 method, extracting image features through self-supervised learning

Image Classification

Vit Large Patch32 384

This Vision Transformer (ViT) model is pre-trained on the ImageNet-21k dataset and then fine-tuned on the ImageNet dataset, suitable for image classification tasks.

Image Classification

Vit Huge Patch14 224 In21k

A Vision Transformer model pretrained on ImageNet-21k, featuring an extra-large architecture suitable for visual tasks like image classification.

Image Classification

Ruroberta Large

A Russian RoBERTa large model pre-trained by SberDevices team with 355 million parameters, trained on 250GB of Russian text

Large Language Model

Transformers Other

Vit Large Patch32 224 In21k

This Vision Transformer (ViT) model is pre-trained on the ImageNet-21k dataset and is suitable for image classification tasks.

Image Classification

Vit Large Patch16 384

Vision Transformer (ViT) is an image classification model based on the transformer architecture, pre-trained on ImageNet-21k and fine-tuned on ImageNet.

Image Classification

Vit Large Patch16 224 In21k

A Vision Transformer model pretrained on the ImageNet-21k dataset, suitable for image feature extraction and downstream task fine-tuning.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase